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GENE EXPRESSION IN MONOCYTES AND MACROPHAGES 

5 The present invention relates to the regulatory nucleotide sequences 
associated with the CD68 gene. 

Background to the Invention 

10 Lqcm$ Cpntrpl Rggiong 

Mammalian gene expression is regulated through cis- linked 
DNA sequences. Promoter sequences lie immediately 5' of the gene's 
transcription initiation site and enhancer sequences can lie within or close 
to the gene which they regulate. In 1987 it was shown that DNA 

IS sequences found within 70 kilobases (kb) 5* of the human, giobin gene 
on human chromosome 1 1 were able to effect high-level, copy number 
dependent expression of the human p-globin gene in erythroid cells of 
transgenic mice (1) Grosveld, F., etai Cell, 1987, 51, 975-985. These 
DNA sequences which play a key role in the in vivo regulation of giobin 

20 gene expression were termed Dominant Control Sequences (OCR) - later 
renamed Locus Control Regions (LCRs). Locus Control Regions have 
been described for other red cell genes such as the genes of the human a- 
globin locus (2) Greaves. D.R., etaf Cell, 1989, 56, 979-986 and LCRs 
have been described which direct high level gene expression in other cell 

25 types. Examples include the human CD2 gene which has a T-cell specific 
LCR and the 3' end of the Ca immunoglobulin heavy chain locus which 
directs high level expression in B-cells (3) Madisen,,,L. and Groudine, M. 
Genes and Development, 1994, 8, 2212-2226. 

30 The CD68 gen is expressed in all macrophage cells. The specificity of 
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expression in vivo is high. The origin of this specificity has been 
investigated and surprisingly it has been found that small regions of the 
CD68 gene are responsible. 

5 Summary of Invention 

The present invention therefore provides a cloned or isolated 
polynucleotide having the function of a transcriptional regulatory sequence 
(trs) and comprising: 
10 (a) a polynucleotide fragment having at least 70% identity 

to the polynucleotide of Seq ID No. 2; 

(b) a poiynucleotide which is complementary to the 
polynucleotide of (a); or 

(c) a polynucleotide comprising at least 15 sequential 
15 bases of the polynucleotide of (a) or (b). 

Preferably the poiynucleotide fragment has at least 80% 
identity to the polynucleotide of Seq ID No. 2, more preferably at least 90% 
identity to the polynucleotide of Seq ID No. 2. Most preferably the isolated 
20 polynucleotide according to the invention comprises the polynucleotide of 
Seq ID No.2. 

The present invention further provides an isolated 
polynucleotide comprising the transcriptional regulatory sequence of CD68. 

The present invention additionally provides an isolated 
25 polynucleotide comprising the transcriptional regulatory sequence of CD68 
and a polynucleotide operatively linked thereto encoding a heterologous 
polypeptide. 

The present invention also provides a vector comprising a 
polynucleotide as defined herein and a host cell comprising the vector 
30 The pres nt invention further provides a process for 
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producing a polypeptide which process comprises transforming or 
transfecting a cell with a vector as defined herein such that the cell 
expresses the polypeptide encoded. 

In a further aspect this invention results from the discovery of DNA 
5 sequences exhibiting the properties of a locus control region associated 
with the CD68 gene. In one aspect the invention provides a vector for the 
integration of a gene into the genetic material of a mammalian host cell 
such that the gene may be expressed in the host cell, the vector 
comprising a promoter and the said gene and a Locus Control Region 
10 capable of eliciting host cell-type restricted, integration site independent, 
copy number dependent expression of said gene, characterised in that the 
Locus Control Region is located within a region extending from 14kb 
upstream to 25 kb downstream of the CD68 gene. 

FMnctiongI <<gflnitipn pf an LQR 

15 Locus Control Regk>ns are fundamentally different from other 

gene regulatory sequences in that they are not subject to position effects 
after integration into host cell chromosomes. Work with the human p- 
globin gene LCR revealed the three criteria which distinguish this class of 
DNA sequence from other regulatory sequences such as promoters and 

20 enhancers (1.4,5). Grosveld, F„ e/a/ Cell, 1987, 51. 975-985., Blom van 
Assendelft. M„ etaL Cell, 1989. 56. 969-977., Talbot. D., etaL Nature, 
1989, 338, 352-355. 

1 ) LCRs direct position-independent expression of co-linked 
genes after integration into host genomes either in transgenic animals or in 

25 cell lines. As a consequence all transgenic animals carrying an intact copy 
of the transgene will express the transgene at a significant level. This is in 
sharp contrast to results obtained with other DNA sequences. 

2) LCRs exhibit strict tissue specificity in the pattem of 
transgene expression. A human p-globin gene placed downstream of a 

30 human p- globin LCR is expressed only in red cells of transgenic mice 
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(1, 4). The exact same human p-globin gene fragment placed downstream 
of a human CD2 LCR is expressed at high levels in all T-cells but not red 
cells of transgenic animals (2). Greaves, D.R.. et al. Cell. 1989. 56. 979- 
986. 

5 3) LCRs direct high-level, copy number-dependent gene 

expression. Human p-globin transgenes can be expressed as efficiently on 
a per copy basis as endogenous murine p-globin genes in the erythroid 
ceils of transgenic mice. When each transgene expresses at the same 
high level this leads to a direct relationship between transgene expression 

10 and transgene copy number. 

LCR's are thus expected to have a region providing chromatin opening 
activity, which region provides at least the first and third activities described 
above. An LCR generally contains at least one transcriptional regulatory 
sequence such as a promoter or enhancer in addition to the sequence 

15 which provides chromatin opening activity. 

The genetic diseases p- thalassaemia and sickle cell 
anaemia are caused by mutations within the human p- globin gene locus. 
All the red blood cells of the body are derived from a limited number of 
20 haematopoietic stem cells found in the bone manrow. The demonstration 
that the p-globin LCR was able to direct red cell-speciftc expression 
regardless of its site of integration into the genome lead us to propose the 
idea of somatic gene therapy for p- thalassaemia and sickle cell anaemia 
by introducing, p-globin LCR vectors into haematopoietic stem cells and 
25 using such transfected cells in bone marrow transplantation. 

This is described in one of the proposed applications set out 
in the original p-globin LCR patents filedJn 1987 and 1989 (UK patents 
8718779 & 8904009.1). 

The Locus Control Region (LCR) of the invention may be 
30 located within a region extending from 5.5 kb upstream to 12 kb 
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downstream of the CD68 gene; particularly from 2940 base pairs (bp) 
upstream of CD68 gene to 335 bp downstream (shown in Seq 10 No. 3) 
and more particularly within a 3 kb BstX1-BstX1 locus immediately 
upstream of the CD68 gene. The LCR may be a single continuous 

5 sequence or may consist of two or more such sequences linked together 
with or without intervening polynucleotides. The LCR may consist of, be 
derived from, or correspond to one or more DNAse hypersensitive sites. If 
the LCR of the naturally occuning gene locus comprises two or more 
discrete sub-sequences separated by intervening non-functional 

10 sequences (for example, two or more hypersensitive sites) the vector of the 
invention may comprise an LCR comprising two or more of the sul>- 
sequences linked together with all or part of the intervening sub-sequences 
removed. 

The term ''vector" as used herein connotes in its broadest 
15 sense any recombinant DNA material capable of transferring DNA from 
one cell to another. 

In another aspect the invention provides a mammalian host 
cell transformed with a vector as defined. The mammalian host cell is 
selected from macrophages, monocytes and dendritic cells and their 
20 precursors. 

In other aspects the invention provides: a method of 
producing a polypeptide by culturing a mammalian host cell as defined; a 
method of modifying mammalian host or stem cells by transfomiation with 
a vector as defined; transformed mammalian host or stem ceils for use in 
25 the treatment of a diseased condition in a human or animal body; and use 
of a vector as defined, or mammalian host or stem cells as defined, for the 
manufacture of a medicament for the treatment of a disease condition of 
the human or animal body caused by a gene deficiency. 

30 Figure 1- restriction maps of cosmids containing the human 
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CD68 gene. 

Figure 2 - DNA sequence of the 5' flanking regions of the 
human CD68 gene, (shown also in Seq ID No. 1) 

Figure 3 - Northern blot analysis of RNA expression in stably 
5 transfected RAW cells. 

Figure 4 - RT PCR analysis of CD68 and macrosialin RNAs 
in stably transfected RAW cells. 

Figure 5 - Northern blot analysis of RNA expression in stably 
transfected RAW and A20 cells 
10 Figure 6 - RT PCR analysis of CD68 and HPRT RNAs in 

stably transfected RAW and A20 cells. 

Detailed Description 

According to the present invention CD68 genes include 
15 human CD68. mammalian non-human CD68, for example mouse, dog, cat. 
rabbit, pig, cow, horse or rat CD68, or non-mammalian CD68, Mouse 
CD68 is known as macrosialin. Preferably the CD58 is human. 

The nucleotide sequences which regulate transcription of 
CD68 gene are contained within the COSMID CD68C1 which is obtainable 
20 from a commercially available human genomic DNA cosmid library 
(Stratagene). 

A transcriptional regulatory sequence (trs) generally 
comprises at least one promoter region and optionally at least one 
enhancer region. A trs may be a single continuous nucleotide sequence or 

25 may consist of two or more such sequences linked together with or without 
intervening polynucleotides. Trs regions are generally 5' (upstream) from 
the jcoding region but may also be fourid 3' (downstream) of the ATG start 
codon, for example in regions of coding sequence or in an tntron. 
Specifically regions of human CD68 trs have been identified in the first 

30 intron downstream of the ATG start codon. 



BNSCXKID: <WO_9742337Al_L> 



wo 97/42337 



PCT/GB97/01209 



It the trs of the naturally occurring gene locus comprises two 
or more discrete sub-sequences separated by intervening non-functional 
sequences (for example, two or more super hypersensitive sites) the vector 
5 of the invention may comprise a trs comprising two or more of the sub- 
sequences linked together with all or part of the intervening sub-sequences 
removed. 

The vector of the invention may be used to integrate into the genome an 
expression cassette which comprises a CD68 trs and a polynucleotide 

10 operatively linked thereto encoding a heterologous polypeptide. The 
vector may also be used to integrate a chimearic gene. 

In either case the open reading frame or coding sequence of 
the gene or cassette is expressed In the host cell. 

The vector comprises at least a trs and an open reading 

15 frame. An open reading frame may comprise introns. An open reading 
frame may comprise only coding regions. 

A chimearic gene comprises at least a trs and an open 
reading frame. Optionally a chimearic gene will further comprise 
polynucleotide regions which regulate replication, transcription or 

20 translation or any other process important to the expression of the 
polynucleotide in a host cell. 

The position of sequences relative to the CD68 gene is 
generally measured relative to the ATG start codon for upstream regions 
and relative to the stop codon for downstream regions. 

25 Isolated or cloned means separate "by the hand of man" from 

its natural state; /,e., that, if it occurs in nature, it has been changed or 
removed from its original environment, or both. For example, a naturally 
occurring polynucleotide or a polypeptide naturally present in a living 
organism in its natural state is not "isolated," but the same polynucleotide 

30 or polypeptide separate from the coexisting materials of its natural stat is 
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"isolated", as the term is employed herein. As part of or following isolation, 
such polynucleotides can be joined to other polynucleotides, such as 
DMAs, for mutagenesis, to form fusion proteins, and for propagation or 
expression in a host, for instance. The isolated polynucleotides, alone or 
5 joined to other polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. Introduced into host cells in 
culture or in whole organisms, such DNAs still would be isolated, as the 
term is used herein, because they would not be In their naturally occurring 
form or environment. Similarly, the polynucleotides and polypeptides may 

10 occur in a composition, such as media formulations, solutions for 

introduction of polynucleotides or polypeptides, for example, into cells, 
compositions or solutions for chemical or enzymatic reactions, for instance, 
which are not naturally occurring compositions, and, therein remain 
isolated polynucleotides or polypeptides within the meaning of that term as 

15 it is employed herein. 

Expression cassettes themselves are well known in the art of molecular 
biology. Such an expression cassette contains all essential DNA 
sequences required for expression of the heterologous enzyme in a 
20 mammalian cell. For example, a preferred expression cassette will contain 
a molecular chimaera containing a coding sequence an appropriate 
polyadenylation signal for a mammalian gene (i.e., a polyadenylation signal 
that will function in a mammalian cell), and enhancers and promoter 
sequences in the correct orientation. 

25 

NonTially, two DNA sequences are required for the complete and 
efficient transcriptional regulation of genes that encode messenger RNAs 
in mammalian cells: promoters and enhancers. Promoters are generally 
located immediately upstream (5') from the start site of transcription. 
30 Promoter sequences are required for accurate and efficient initiation of 
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transcription. Different gene-specific promoters reveal a common pattern 
of organisation. A typical promoter includes an AT-rich region called a 
TATA box (which is located approximately 30 base pairs 5' to the 
transcription initiation start site) and one or more upstream promoter 

5 elements (UPEs). The UPEs are a principle target for the interaction with 
sequence-specific nuclear transcriptional factors. The activity of promoter 
sequences is modulated by other sequences called enhancers. The 
enhancer sequence may be a great distance from the promoter in either an 
upstream (5*) or downstream (3') position. Hence, enhancers operate in an 

10 orientation- and position-independent manner. However, based on similar 
structural organisation and function that may be interchanged, the absolute 
distinction between promoters and enhancers is somewhat arbitrary. 
Enhancers increase the rate of transcription from the promoter sequence. 
It is predominantly the interaction between sequence-specific 

15 transcriptional factors mth the UPE and enhancer sequences that enable 
mammalian cells to achieve tissue-specific gene expression. The 
presence of these transcriptional protein factors (tissue-specific, 
trans-activating factors) bound to the UPE and enhancers (cis-acting, 
regulatory sequences) enables other components of the transcriptional 

20 machinery, including RNA polymerase, to initiate transcription with 
tissue-specific selectivity and accuracy. 

Identity, as known in the art, is the relationship between two or more 
polynucleotide sequences, as determined by comparing the sequences. In 

25 the art, identity also means the degree of sequence relatedness between 
polynucleotide sequences, as the case may be, as determined by the 
match between strings of such sequences. Identity can be readily 
calculated (C^mpufatfona/Mo/ecu/arB/o/ogy. Lesk, A.M.. ed., 
University Press, New York, 1988; Biocomputing: Informatics and Genome 

30 Projects, Smith. D.W.. ed.. Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A.M.. and Griffin, H.G., eds.. 
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Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press. 1987; and Sequence Analysis 
Primer, Gribskov, M. and Devereux. J., eds., M Stockton Press, New York. 
1 991). While there exist a number of methods to measure identity 
5 between two polynucleotide sequences, the term is well known to skilled 
artisans {Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; Sequence Analysis Primer, Gribskov, M. and 
Devereux. J., eds.. M Stockton Press, New York. 1991; and Carillo, H., and 
Lipman, D.. SIAM J. Applied Math,, 48: 1073 (1988). Methods commonly 

10 employed to determine identity between sequences include, but are not 
limited to those disclosed in Carillo, H., and Lipman, D., SIAM J. Applied 
Math., 48:1073 (1988). Prefen^ed methods to detemiine identity are 
designed to give the largest match between the sequences tested. 
Methods to detennine identity are codified in computer programs. 

15 Preferred computer program methods to determine identity between two 
sequences include, but are not limited to, GCG program package 
(Devereux. J., et al.. Nucleic Acids Research 12(1): 387 (1984)). BLASTP. 
BLASTN. and FASTA (Atschul, S.F. et al.. J. Molec. BioL 215: 403 (1990)). 
Polynucleotides which have 70% or more identity with the 

20 polynucleotides herein defined may differ from the defined sequences by 
virtue of at least one nucleotide substitution, addition, deletion, fusion or 
truncation in the polynucleotide. 

One of the regions in the CD68 gene which has been found 
to be critical for the macrophage specificity of the CD68 gene is the region 

25 which extends from the ATG start codon to the nucleotide approximately 
80 base pairs upstream (5') to the ATG start codon. This will be referred to 
as the -80bp region. 

A second region which has been found to be critical is the 
region downstream (3*) of the ATG start codon of the CD68 gene and 

30 which contains the first PU.1 site downstream of the start codon. A PU.1 
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site is a DNA sequence capable of binding the etf - family transcription 
factor PU,1 which is expressed in macrophages, neutrophills and B-cells. 

The first PU.1 site downstream of the start codon is 
comprised within an intron. In human CD68 the intron containing the first 
5 PU.1 site is from nucleotide 32 to nucleotide 137 downstream of the start 
codon. 

The region containing the first downstream PU.1 site and the 
-80bp region when operatlvely linked provide macrophage specificity. Seq. 
ID No 2 comprises the sequence of human CD68 from -BObp to the ATG 
10 start codon and comprises a PU.1 site and the first intron downstream of 
the ATG start codon. 

A further region which has been found to be critical in 
providing specificity is the region containing the PU.1 site approximately 
1 lObp upstream of the ATG start codon. Thus the nucleotide sequence 
15 from the ATG start codon to approximately 150 base pairs upstream from 
the ATG start codon confers enhanced macrophage specific expression. 

A further region which confers specificity Is at the PU.1 site 
which is found approximately 432 base pairs upstream from the ATG start 
codon. Thus the nucleotide squence which extends from the ATG start 
20 codon to approximately 460 base pairs upstream from the start codon 
confers further enhanced macrophage specific expression. 

High macrophage specificity is observed when a gene is 
expressed under the control of any of the above sequences. High 
macrophage specificity may also be obtained when a gene is expressed 
25 under the control of the region which extends from 2940 base pairs 
upstream of the ATG start codon to 335 base pairs downstream of the 
TGA stop codon. 

The first sequence of the first intron of macrosialin has also 
been determined and is also a polynucleotide of the invention. The whole 
30 sequence of the open reading frame of macrosialin is given in Seq. ID No 
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4. Oligonucleotide primers derived from the murine macrosialin cDNA 
sequence were used to PGR amplify a 1759bp fragment from Sv129 
mouse genomic DNA. The genomic PGR product was cloned into the 
InVttrogen plasmid vector pCRIi and the macrosialin gene sequence was 
5 detemiined by dye temiinator DNA sequencing of double stranded DNA. 
DNA coding sequence is shown in uppercase and underlined, 5' and 3' 
untranslated regions and intervening sequences are shown in lowercase. 
The ATG initiator and TGA termination codons are underlined. 

The present invention therefore provides a polynucleotide 

10 comprising 

(a) a polynucleotide fragment having at least 70% identity 
to a polynucleotide as defined herein; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a); or 

15 (c) a polynucleotide comprising at least 15 sequential 

bases of the polynucleotide of (a) or (b) 

Preferably the polynucleotide fragment has at least 80% 
identity to a polynucleotide as herein defined, more preferably at least 90% 
identity to the polynucleotide as herein defined. Most preferably the 
20 isolated polynucleotide according to the invention comprises the 
polynucleotide as herein defined. 

Specifically the present invention provides a 
polynucleotide comprising an isolated polynucleotide comprising: 

(a) a polynucleotide fragment having at least 70% identity 
25 to the polynucleotide of Seq ID No. 3; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a);, or 

(c) a polynucleotide comprising at least 15 sequential 
bases of the polynucleotide of (a) or (b). 

30 A preferred nucleotide according to the invention comprises 
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the region containing the first PU1 site and the -80bp region. 

A further preferred nucleotide comprises the region from 
+150 bp to -137 bp relative to the ATG start codon. 

A still further preferred nucleotide comprises the region from 
5 +460 bp to -1 37 bp relative to the ATG start codon. 

The nucleotide sequence of each of these regions may be 
determined from Seq. ID No 3 which shows the sequence of cosmid 
CD68C1 from 2951 bp 5' of the CD68 ATG initiation codon (shown in bold 
type) to 2090 bp 3' of the CD68 TGA termination codon (shown in bold 
10 type). Intron sequences are in lowercase, and CD68 coding and 3' 
untranslated regions are underlined. 

Each of the nucleotide sequences of the invention may be 
used in inverted form. 

More than one copy, for example two. three or four copies, of 
15 each of the regions described above may be used to control expression of 
a single gene. When more than one copy of a region is used, the copies 
are preferably operabiy linked. The regions may be used in combination or 
separately. 

The sequence ID No. 2 may be used as a probe to locate the 
20 sequence ID No. 3, which may then be manipulated according to 
conventional techniques using known restriction sites. 

In a search for macrophage-specific gene regulatory 
sequences recombinant cosmids were isolated containing the human 
CD68 gene by PGR screening of a human genomic DNA library in the 
25 cosmid vector pWE15. Human CD68 is the human homologue of mouse 
macrostalin, a protein which is found in the phagosomal compartment of all 
monocytes and macrophages. Two types of human CD68 cosmids were 
obtained which contain the complete CD68 gene with 14 kilobases of 5' 
flanking sequences and 25 kilobases of 3' flanking sequences (Figure 1). 
30 The cosmids were used as probes to demonstrate that the 
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human C068 gene is located on the short arm of human chromosome 1 7. 
The cosmids were used as templates to determine the DNA sequence of 
the CD68 gene and 1870bp of the CD68 promoter (Figure 2). 

The CD68 cosmids were transfected into the murine 
s macrophage cell line RA\N26A.7 and stably transfected RAW cell 

populations and polyclones were selected by growth in G418. Northern 
blot analysis of total RNA isolated from G418 resistant RAW cells was 
perfomned using a radiolabelled human CD68 gene probe (Figure 3A) and 
a mouse lysozyme probe to control for RNA loading and transfer 

10 (Figure 3B). Total RNA prepared from human peripheral blood 

mononuclear cell (PBMC) cultures was analysed on the same filter. The 
human CD68 cosmid transfected RAW cells express very high levels of 
human CD68 mRNA of the expected size. The levels of hunian CD68 
mRNA are higher in cosmid transfected RAW cells than in cultured human 

15 monocytes. This result was confirmed by analysis of transfected RAW cell 
RNA by Reverse Transcription - Polymerase Chain Reaction (RT-PCR) 
analysis using human CD68 and murine macrosiaiin specific PGR primers 
(Figure 4). The transfected human CD68 gene is expressed at higher 
levels than the endogenous mouse macrosiaiin gene. 

20 The same CD68 cosmid DNAs were transfected into murine 

A20 B- cells an d 3T3 fibroblasts. Northern blot analysis of CD68 RNA 
expression showed levels of CD68 mRNA at least 50 times lower than that 
observed in murine macrophage cell line RAW264.7 transfected with the 
same cosmids (Figure 5). These results were confirmed by RT-PCR 

25 analysis of transfected cell RNA using CD68 and HPRT specific PGR 
primers (Figure 6). 

Single celt clones from G418 resistant RAW cell populations 
have been isolated and will be analysed to determine if there is a direct 
relationship betwe n transgene copy number and transgene xpressionin 

30 transfected RAW264.7 cells which would be predicted if there is a 
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macrophage-specific Locus Control Region In the CD68 cosmids. 

The extremely high level of human CD68 mRNA seen in 
cosmid transfected RAW cells (Figures 3.4 and 5) show that important 
macrophage-specific genetic regulatory elements are contained within the 
5 CD68 cosmids cosCD68C1 and cosCD68G1. Our previous work using the 
human lysozyme promoter in transgenic animal experiments has shown an 
Important role for intervening sequences and poly A+ addition sequences 
for transgene expression in macrophages (7,8) Dighe, A.S.. et ai 
Immunity. 1995, 3, 657-666. and Clarke. S., etal Proc.Natl. Acad. Sci. 
10 (USA). 1996. 93; 1434-1438.. A role for intervening sequences in 

ensuring efficient transgene expression has been described in a number of 
other systems. 

Development of a macrophaae-SDecific aene expression vector 

15 in order to express heterologous genes at a high level in 

monocytes and macrophages it would be desirable to design a C068 
expression vector. Such a vector should Include all the DNA sequences in 
the human CD68 cosmids which direct high-level, macrophage-specific 
gene expression along with the CD68 introns and poly A+ sequences and 

20 unique restriction enzyme recognition sites for the insertion of cDNAs 
encoding heterologous genes of interest. 

To facilitate manipulation of the human CD68 gene 
sequences present in cosmid cosCD68C1 a 20kb EcoRi fragment which 
contains the complete human CD68 gene along with 5.5kb of 5' flanking 

25 and 12.5kb of 3' flanking sequences was cloned into the unique EcoRI site 
of the plasmid vector Bluescript SK- (Stratagene) to give the recombinant 
plasmid pCD68R1A (Figurel). Two further recombinant plasmids were 
derived from pCD68R1 A which contain the 5' flanking and 3' flanking 
sequences of the human CD68 gene, pCD68RS1 and pCD68SR1 (Figure 

30 1). In order to engineer the insertion of cDNAs encoding heterologous 
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genes immediately downstream of the CD68 gene promoter and 
immediately upstream of the CD68 gene*s ATG initiation codon the plasmid 
pCD68RS1 was digested with the restriction enzyme BstX1 and a 3kb 
BstX1 fragment was cloned into the plasmid vector Bluescript 
5 SK(Stratagene) after treatment with T4 DNA polymerase which removes 
the CD68 ATG initiation codon (Figure 2). The human CD68 genomic DNA 
fragments present in the recombinant plasmids shown in Figure 1 allow for 
the development of a versatile and easily manipulated human CD68 
expression system for use in macrophage cell lines, transgenic animals 
!0 and human primary ceils. 

Table 1 Promoter activity of CD68 promoter 5' d eletion series in 

trangjently trangfgcteci RAW2g4.7 and P398.P1 cells. 

15 RAW264.7 or P388.D1 cells were electroporated in the presence of 
20\iQ of the indicated CAT reporter plasmids and Sjig of the p- 
galactosidase reporter plasmid pcDNA3 p-gal. Forty eight hours post 
transfection cell iysates were assayed for p-galactosidase and CAT 
enzyme activity as described in Experimental Procedures. Cell lysate 

20 CAT enzyme activities were corrected for transfection efficiency and 
the data is expressed as a percentage of the CAT enzyme activity 
obtained with the SV40 promoter/enhancer plasmid pCATControl in the 
same experiment. All cell lysate CAT enzyme activities were within the 
linear range of the assay as determined from a CAT enzyme dilution 

25 series analysed in the same experiment. The data shown are from a 
single transfection experiment and the relative promoter activities were 
reproducible in at least three independent transfection experiments with 
each construct in both cell lines. 

30 Table 2 Com parison of myeloid cene promoter activities in 

transiently transfected nmrine macrophage cgH lines. 
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P388.D1 or RAW264.7 cells were electroporated in the presence of 
20|xg of the indicated CAT reporter plasmids and Spg of the p- 
galactosidase reporter plasmid pcDNA3 p-gal. Twenty four hours post 

5 transfection cell lysates were assayed for p-galactosidase and CAT 
enzyme activity as described in Experimental Procedures. Cell lysate 
CAT enzyme activities were corrected for transfection efficiency and 
expressed as a percentage of the CAT enzyme activity obtained with the 
SV40 promoter/enhancer plasmid pCATControl in the same experiment. 

10 All cell lysate CAT enzyme activities were within the linear range of the 
assay as determined from a CAT enzyme dilution series analysed in the 
same experiment. The data shown are from a single transfection 
experiment and the promoter activities were reproducible in at least two 
independent transfection experiments with each construct. 

15 

T9ble3 



Plasmid 

-2940CD68pCAT 

hLZMpCAT 

mLZMpCAT 

CDIIbpCAT 

c-fes pCAT 

hMSRpCAT 



Gene 

hCD68 

hLysozyme 

mLysozyme 

hCD11b 

h c-fes 

hMSR 



Promoter 

-2940 to +2* 
-517 to +26 
-487 to +11 
-1706 to +91 
-446 to +71 
-487 to +1 1 



Accession 
No 

This paper 

X57103 

D13263 

M76724 

X06292 

D13263 



20 Table 3 shows Myeloid promoters used in this study 

All myeloid gene promoter fragments were cloned into the multiple cloning 
site of the CAT reporter vector pCATBasic (See Experimental Procedures). 
Human genes are denoted by the prefix h and murine genes by the prefix 
25 m. 

Promoter coordinates shown are taken from the given Genbank accession 
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numbers and the major transcription initiation site is denoted as +1 except 
for 

CD68 where the A of the ATG translation initiation codon is denoted as +1 

n 

5 

Table 5 The effect of the CD68 first intron in transiently 
transfected RA\AP64.7 and CHQ cell lines. 

10 RAW264.7 cells were electroporated in the presence of 20(xg of the 
indicated CAT reporter plasmids and Sfxg of the p-galactosidase 
reporter plasmid pcDNA3 p-gal. Forty eight hours post transfection 
transfected cell lysates were assayed for p*galactosldase and CAT 
enzyme activity. CHO cells were transfected with 4.5\xq of the indicated 

15 CAT reporter plasmids and O.Sfig of the p-galactosidase reporter 
plasmid pcDNA3 p-gal compiexed with 50^g of the cationic lipid 
Lipofectamine (Gibco BRL). Cell lysate CAT enzyme activities were 
corrected for transfection efficiency and expressed as a percentage of 
the CAT enzyme activity obtained with the SV40 promoter/enhancer 

20 plasmid pCATControl analysed in the same experiment. All cell lysate 
CAT enzyme activities were within the linear range of the assay as 
determined from a CAT enzyme dilution series analysed in the same 
experiment. The data shown are from a single transfection experiment 
and the promoter activities were reproducible in at least three 

25 independent transfection experiments with each construct in each cell 
line. 

Table 4 The effect of the CD68 fir st intron on CD68 and HIV oromoters 
30 RAW264.7 and P388.D1 cells were electroporated in the presence of 
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20ng of the indicated CAT reporter piasmids and 5^g of the p- 
galactosidase reporter plasmid pcDNA3 p-gal. Forty eight hours post 
transfection transfected cell lysates were assayed for p-galactosidase and 
CAT enzyme activity. Cell lysate CAT enzyme activities were con-ected 

5 for transfection efficiency and expressed as a percentage of the CAT 
enzyme activity obtained with the SV40 promoter/enhancer plasmid 
pCATControl analysed in the same experiment. All cell lysate CAT 
enzyme activities were within the linear range of the assay as 
detemiined from a CAT enzyme dilution series analysed in the same 

10 experiment. The data shown are from a single transfection experiment 
and the promoter activities were reproducible in at least three 
independent transfection experiments with each construct in each cell 
line. 

15 

ME TH OP g 

Construction of promoter reporter piasmids 

A 2940bp BstXI fragment was purified from an EcoRI - Spel fragment 
20 subcloned from cosCD68C1 . The BstXI fragment was rendered blunt 
ended by incubation with T4 DNA polymerase and all four dNTPs and 
cloned into the EcoRV site of pBluescipt SK- (Stratagene) to give plasmid 
pCD68Bst3"2. The 3' BstXI site contains the CD68 ATG initiation codon 
which was removed by the 3" exonuclease activity of T4 DNA polymerase. 
25 A Hind 111 - Xbal fragment containing 2940 bp of DNA upstream of the 
CD68 ATG codon was cloned into the reporter vector pCAT Basic 
(Promega, Genbank accession number X65322) to give the plasmid -2940 
GD68pCAT. Plasmid -2940<GD68pCAT was digested with Xhol, Bglll or 
SstI and overhanging ends filled in by treatment with the Kienow fragment 
30 of DNA polymerase or T4 DNA polymerase before ligation of 

phosphorylated Hindlll linkers. Following digestion with Hindlll and Xbal, 
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5' truncated CD68 promoter fragments were gel purifted and subcloned 
into Hindlll and Xbal digested pCATBasIc to give the plasmids - 
2258CD68pCAT. -1576CD68pCAT and -•951CD68pCAT. All other CD68 
promoter deletions were prepared by PCR using plasmid -2940 
5 CD68pCAT as a template and 5' oligonucleotide primers which added a 
Hindlll site and a common 3* PCR primer which spanned the Xbal cloning 
site of plasmid -2940 CD68pCAT (listed In Table 1). Amplified fragments 
were digested with Hindlll and Xbal, gel purifed and subcloned into Hindlll 
and Xbal digested pCATBasic to give the plasmids -333CD68pCAT. - 
10 232CD68pCAT. -150CD68pCAT and -80CD68pCAT. The first intron of the 
CD68 gene was PCR amplified using primers 5* 
ccggaattcTGCTGGGGCTACTGGCAG and 5* 

tgatctagaGTCCCCTGGGCTTTTGGCAG which added EcoRI and Xbal 
sites (underlined). Following EcoRI and Xbal digestion the CD68 intron 

15 fragment was cloned into pCD68Bst3-2 digested with EcoRI and Xbal to 
give plasmid pCD68BstlVS and a 3022bp Hindlll - Xbal fragment was 
cloned into pCATBasic to give the plasmid -2940 IVSpCAT. The HIV 
minimal LTR construct HIV pCAT is from Lew et aL (1991) Mol. Cell. Biol. 
11, 182-191. Construct HIV IVS pCAT was made by ligating the EcoRI to 

20 Xba I IVS I fragment of pCDBSBstIVS into the unique Bglli site in the HIV 
tar sequence. All CD68 promoter reporter constructs were sequenced 
using M13 reverse and CAT primers and shown to exactly match the CD68 
promoter sequence shown in Figure.A 1.7 kb Hindlll-BamHI (blunt) 
fragment containing CD11b sequences from -1706 to +91 was excised 

25 frorti the plasmid pB202 (Dzjemmis et al, 1995. Blood, 85. 319-329 and 
cloned between the Hindlll and Xbal (blunt) sites of pCATBasic to give the 
clone CD1 lb pCAT. A 516bp Hind III - Xba I fragment containing c-fes 
sequences from -446 to +71 was excised from the plasmid p446 (a kind gift 
of Celeste Simon. H ydemann et al.. 1996) and cloned between the Hind 

30 III and Xba I sites of pCATBasic to give the plasmid c-fes pCAT. Other 
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myeloid gene promoters were cloned by PGR using the primers designed 
from published sequences listed in Table I with cloned DMA or human 
genomic DNA templates. A Sma-BamHI fragment containing the SV40 
eariy splice and polyadenylation sequences from plasmid pMSG 

5 (Pharmacia) was ligated into Smal-BamHI digested plasmid pH2KBS 
which contains the 1.6kb H2K cDNA cloned into the EcoRV site of 
pBluescript (a kind gift of Dr D. Mokophidis) to give the plasmid pH2KSV. 
A 2.5kb Hindlll-BamHI fragment of pH2KSV was rendered blunt ended by 
treatment with T4 DNA polymerase and ligated into Smal digested CD68 

10 promoter plasmid pCD68Bst3-2 to give plasmid pCD68-H2KSV and the 
same pH2KSV fragment was ligated into Hindi digested human lysozyme 
promoter plasmid pBH7.4 (Clarke et al. 1996) to give hL2M-H2KSV. 
Plasmid pkb-Hindlll contains a 7.4kb Hindlll genomic DNA fragment of the 
H2Kb gene (Weiss et al., 1992). 

15 

Supercoiled plasmid DNAs were prepared from 500ml cultures of E.coli by 
NaOH/SDS lysis and purified by equilibrium centrifugation in CsCI/ethidium 
bromide gradients followed by phenol/chlorofomi extraction and ethanol 
precipitation (Sambrook, Fritsch & Maniatis 1989). 

20 

Mammalian cell culture and transient transfection 

The murine macrophage cell lines RAW264,7 and P388.D1 were 
maintained in RPM1 1640 medium (Gibco BRL) supplemented with 10% 

25 heat inactivated foetal calf serum (PCS) (Sigma), 100 units ml-1 penicillin, 
100 ^xg ml-1 streptomycin, 2mM glutamine and 10mM Hepes (pH 7,0). 
CHO K1 cells were maintained in Ham's F-12 medium (Gibco BRL) 
supplemented with 10% PCS, antibiotics and glutamine. All cells were 
grown at 37oC in a humidifed Incubator in 5% C02/air. RAW264.7 and 

30 P388.D1 cells were grown to confluence in T175 flasks, han^ested in 
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Phosphate Buffered Saiine (PBS) and washed once and resuspended in 
Optimem 1 serum free medium (Gibco BRL) for RAW cells or RPMI 1640 
(no FCS) for P388D1 cells . Cells were counted and adjusted to a final 
density of 4 x 107 cells mM . Aliquots of 2 x 107cells (0.5ml) were mixed 
5 with 50 fxg CAT reporter plasmid DNA and 5 pcDNA3 b-galactosidase 
piasmid DNA, added to a 0.4cm electrode gap eiectroporation cuvette 
(BioRad) and shocked in a BioRad GenePulser (300V. 960 fxFD) at room 
temperature. Cells were recovered immediately into lOmi of ceil growth 
medium which had been pre-warmed to 37oC and plated into 35mm and 

10 9cm diameter tissue culture petri dishes (Nunc). Cells were analysed 24 or 
48 hours post eiectroporation for transfection efficiency by staining fixed 
permeabilised cells with 5-broma4-diloro-3-indoiyl-p-D<-gaiactopyranoside 
(X-gal, Sigma) as described (Hogan et al. 1994). Transient transfection 
efficiencies of between 20 and 30% were routinely obtained with P388.D1 

15 murine macrophages and after optimisation transient transfection 

efficiencies in excess of 40% could be obtained with RAW264.7 cells. CHO 
cells were grown to 70-80% confluence in 9cm petri dishes, washed twice 
with Optimem before addition of 5ml of plasmid DNA:cationic lipid complex 
(5 ng DNA:50 |ig Lipofectamine (Gibco BRL) in Optimem). After 6-16 

20 hours incubation the medium was aspirated, cells were washed twice with 
PBS and recovered Into complete medium for 24-36 hours before analysis. 
X-gal staining and FACS analysis routinely showed CHO cell transient 
transfection efficiencies in excess of 40%. 

25 Reporter gene assays 

Transfected cells were harvested by scraping in PBS, 
washed once with PBS and cell pellets were resuspended in lOO^il 0.25M 
Tris-HCI (pH 7.8) and subjected to three rounds of freeze thaw lysis. Cell 
lysates were assayed for p-galactosidase enzyme activity using the 

30 colorimetric substrate chlorphenolred p-D-galactopyranoside (CPRG, 
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Boehringer Mannheim) in a 96 well plate assay in 50mM potassium 
phosphate buffer (pH7.3) with 2mM MgCiZ. Enzyme activity was 
detennined by spectrophotometry at 570nm after 30 minutes incubation at 
37oC using dilutions of purified E.coli p-galactosidase enzyme (Sigma) to 
5 generate a standard curve. CAT enzyme activity of cell lysates was 
determined after heat treatment (65oC, 20 minutes) using 10 ^Ci 14C 
labeled chloramphenicol (54Ci mmoU, Amersham) as substrate in a 125^x1 
reaction in 0.25M Tris-HCI (pH 8.0) containing 0.2 mg ml-1 n-butyryl CoA 
as cofactor. CAT enzyme activity was measured by determining the 
10 amount of butyryl-14C chloramphenicol extracted into mixed xylenes 
(Aldrich) after a 2 hour incubation at 37oC (Seed ref). A CAT enzyme 
standard curve was generated using dilutions of purified CAT enzyme 
(Promega) in each experiment and all CAT enzyme assays using 
transfected cell extracts were within the linear range of the enzyme assay. 

15 

Figure 1 . Restriction maps of cosmlds containing the human CD68 gene. 

Recombinant cosmids containing the human CD68 gene in 
the vector pWE15 were isolated by PCR screening of pools of robotically 
picked Ampicillin resistant HB101 colonies using PCR primers CD68 LI & 

20 CD68 L2. CD68 PCR positive single colonies were used to prepare 
cosmid DNA which was analysed by restriction mapping and Southem 
blotting using radioactively labelled CD68 gene probes. Sites for the 
restriction enzymes Spe I (Spe). Cla I and Not I are shown. Not I sites in 
brackets (Not I) are derived from the pWE15 cosmid vector cloning site and 

25 flank the human genomic DNA insert. 

Below the map of cosmid cosCD68C1 is shown the position 
of a 20kb EcoRI fragment cloned into the plasmid vector pBluescriptSK- to 
give the recombinant plasmid pCD68R1 A. The positions of other 
COSCD68C1 restriction fragments cloned into the plasmid vector 

30 pBluescriptSK- are also indicated. The 3kb BstXi fragment the 3' end of 
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which contains the CD68 gene ATG initiation codon is indicated. 

Figwre DNA sequence of the 5' flanking region of the human CD68 
gene. 

5 The DNA sequence of the 5* flanking region of the CD68 

gene was determined by double strand sequencing using the CD68 
cosmids as template and oligonucleotide primers derived from the 
published human CD68 cDNA sequence (17).The initiator Methionine 
encoding ATG codon of the CD68 gene which is contained within a BstXI 

10 restriction enzyme recognition site is boxed. CD68 coding regions are 
underlined and intervening sequences inferred from comparison with the 
published CD68 cDNA sequence are delimited by vertical arrows. The 
sequence has not yet been confirmed on both strands, ambiguities in 
multiple sequence reads are indicated by lowercase letters according to 

15 the lUPAC-lUB standards described in Nuc. Acids. Res. 13, 3021-3030 
(1985). Dots in the sequence indicate a chemical bond. 

Figure 3. Northern blot analysis of RNA expression in stably transfected 
RAW cells. 

20 Total RNA was prepared from G41 8 resistant RAW cell 

populations (derived from at least 10,000 independent G418 resistant 
colonies) or polyclones (derived from 50 -100 independent G418 resistant 
colonies). RAW cells were transfected with 10 jig of supercoiled or Pvu I 
linearised CD68 cosmids or a pcDNA3 CAT plasmid which confers 

25 resistance to G41 8. 

Total RNA (10 jig) was denatured with fonnaldehyde. 
subjected to agarose gel electrophoresis, transferred to nylon membranes^ . . 
and hybridised with radioactively labelled human CD68 and mouse 
lysozyme probes. For comparison total RNA (5 |ig) prepared from human 

30 THP1 cells tr ated with PMA for 24 hours and human PBMC cultures was 
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analysed. A 6 hour autoradiographic exposure is shown. 

Figure 4 . RT PGR analysis of CD68 and macrosialin RNAs in stably 
transfected RAW cells. 
5 RNAs (10 |xg) prepared from the G418 resistant RAW cell 

populations analysed by Northern blotting in Figure 3 were used to prepare 
oligo dT-primed cDNA in a reverse transcription (RT) reaction in a final 
volume of 100 nl Control RT reactions omitting reverse transcriptase were 
performed using the same RNA samples (-RT). A 30 cycle Polymerase 
10 Chain Reaction (PGR) using macrosialin and CD68- specific primers was 
performed using 1 ^1 of the neat RT reaction or indicated dilutions as 
template. The RT PCR products were analysed by agarose gel 
electrophoresis and the position of mouse macrosialin and human CD68 
PCR products of the predicted sizes are indicated. 

15 

Figure 5 . Northern blot analysis of RNA expression in stably transfected 
RAW and A20 cells. 

RAW and A20 B-cells were transfected with 10 ^g of 
supercoiied or Pvu I linearised CD68 cosmids or a pcDNA3 CAT plasmid 

20 which confers resistance to G418. Total RNA (10 jxg) prepared from G418 
resistant cell populations was denatured with formaldehyde, subjected to 
agarose gel electrophoresis, transferred to nylon membranes and 
hybridised with a radioactively labelled human CD68 probe. For 
comparison total RNA prepared from human THP1 cells treated with PMA 

25 for 24 hours (5 ^g) and human PBMC cultures (2 iig) was analysed. A 6 
hour autoradiographic exposure is shown. 

Figure 6 . RT PCR analysis of CD68 and HPRT RNAs in stably 
transfected RAW and A20 cells. 
30 RNAs (10 ^g) prepared from the G418 resistant RAW cell 
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populations analysed by Northern blotting in Figures 3 & 5 were used to 
prepare oligo dT-primed cDNA in a reverse transcription (RT) reaction in a 
final volume of 100 A 30 cycle Polymerase Chain Reaction (PGR) 
CD68- and HPRT- specific primers was performed using 1R1 of the neat 
5 RT reaction or indicated dilutions as template. The RT PCR products were 
analysed by agarose gel electrophoresis and the position of human CD68 
PCR and HPRT PCR products of the predicted sizes are indicated. 

Table 1 

10 

This table shows the effect of deleting CD68 5' promoter sequences in 
RAW and P388.D1 transfections 



-2940pCAT 


38.6 


78.4 


-666pCAT 


37.5 


74.4 


-575pCAT 


48.4 


129.5 


-460pCAT 


71.6 


132.2 


-333pCCAT 


30.5 


112.3 


-232pCAT 


34.4 


96.4 


-150pCAT 


11.6 


176 


-SOpCAT 


0.64 


9.5 


pCATBasic 


0 


2 


pCATContro! 


100 


100 


Plasmid construct 


RAW264.7 


P388.D1 



Table 2 

15 

This table compares the level of CAT reporter gene activity of plasmids 
with different myeloid gene promoters in P388.D1 and RAW264.7 
pCAT Control 100 100 
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pCAT Basic 


0 


8.4 


-2940 CDooIVS 


I If 


163.9 


-2940 CD68 pCAT 


39 


50 


hLZM pCAT 


17.6 


66 


mLZM pCAT 


20.1 


73.3 


CDUbpCAT 


42 


23 


-c-fes pCAT 


4.8 


47.3 


hMSR pCAT 


3 


35.5 



CAT Reporter RAW264.7 P388.D1 

Plasmid 

Table 4 

This table compares the level of CAT reporter gene activity of plasmids 
with CD68 and HIV minimal LTR promoters in P388.D1 and RAVyG64.7 
5 cells 



pCAT Control 100 100 

pCAT Basic 0 8.4 

-2940CD681VS 117 163.9 

-2940 CD68pCAT 39 50 

HIV IVS pCAT 0.8 15 

HIVpCAT 0.8 69 

CAT Reporter RAW264.7 P388D1 
Plasmid 



Table 5 

This table shows the effect of adding the CD68 IVS infron on to diff r nt 
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CD68 promoter fragments in RAW and CHO cells 



pCAT Control 


100 


100 


pCAT Basic 


0 


0 


-2940 CD68pCAT 


31 


56 


-2940 IVS 


154 


63 


-80 pCAT 


0.6 


5 


-80 IVS 


53 


1.9 



Plasmid construct RAW264.7 CHO 
s Potential Ap plications for the CD68 LCR 

Th9 Macrophage as 9 dqliverY vghictg for gen? therapy 

Macrophages have several important advantages over other 
cell types for delivering therapeutic gene products in a range of important 

10 human diseases 

Macrophages have a high biosynthetic capacity. 
Macrophages secrete physiologically significant amounts of cytokines, 
growth factors, inflammatory mediators, proteases, protease inhibitors and 
other important biologically active macromolecules. 

15 Macrophages have a limited life span. 

After commitment tissue resident and recruited macrophages undergo only 
one or two cell divisions at most, this would be a distinct advantage in 
many human gene therapy protocols. 

Macrophages, are found in virtually all tissues. 

20 The presence of macrophages in virtually every tissue in the body in 
significant numbers increases the utility of an LCR which directs 
macrophage-specific gene expression. The pr sence of macrophages in 
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the lung and gut allows for recombinant DNA delivery to macrophages by a 
number of different routes. 

Macrophages are rapidly recruited into sites of inflammation. 
The ability to direct heterologous gene expression in a cell type which is 
5 recruited to sites of inflammation offers unique avenue for therapeutic 
Intervention in chronic Inflammatory disease. 
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Somatic Gene Therapy 

There are currently no human monogenic disorders described which 
specifically affect macrophage function which would be candidates for 
"classical" somatic gene therapy. However the CD68 LCR could be used 
5 to drive the expression of therapeutic gene products in human 

macrophages in a whole range of important human diseases. Candidate 
genes would include: Local delivery of soluble IL-IRa to inflammed rat 
knee joints by recombinant retroviral vectors has been shown to be 10 000 
times more efficient than systemic administration of slL-1 RA in reducing 

10 experimentally-induced arthritis (9) Makarov, S,S.. et al Proc, Natl. Acad. 
Sci. (USA), 1996. 93; 402-406. 

Currently there is much interest in anti TNF-a therapy in 
treatment of toxic shock syndrome and a number of autoimmune diseases 
such as rheumatoid arthritis which afflicts 1% of the UK population. 

15 Many important human diseases are the result of a failure in regulation of 
the immune system. Macrophages and cytokines secreted by 
macrophages such as IL-12 and IL-10 play a key role in regulating 
T-lymphocyte function. Macrophages-specific expression of trans- 
dominant negative IL-4 receptors could find application in autoimmune 

20 diseases such as Ulcerative Colitis and Crohn's Disease. 

Apo E and several other gene products have been proposed 
to play a key role in the development of atherosclerotic plaques which are 
the cause of blood vessel occlusion in atherosclerosis and strokes, Apo E 
expressed in macrophages present in atherosclerotic plaques might find 

25 application in the treatment of vascular occlusion (1 0). CD68 is present in 
the macrophage derived foam cells found in human disease tissue and we 
have shown the mouse homologue of CD68, macrpsialin, to be present at 
high levels in the atherosclerotic plaques of an apo E'^* mouse model of 
atherosclerosis. 

30 The murine homologue of CD68, macrosialin has been 
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reported to bind oxidised LDL (11). Overexpression of human CD68, 
murine macrostaim or human Macrophage Scavenger Receptor in 
macrophages present in atherosclerotic plaques may usefully reduce the 
amount of atherogenic oxidised LDL in atherosclerotic lesions. 
5 Chronic Granulomatous Disease (CGD) is a genetic disease 

caused by mutation of the NAPDH oxidase gene. CGD patients fail to 
make reactive oxygen metabolites in their phagocytic cells which kill 
bacteria and hence patients suffer from recurrent bacterial infections. 
Expression of NAPDH oxidase in macrophages of CGD patients by 

10 somatic gene therapy would be highly beneficial (12). 

There are several relatively rare human genetic diseases 
which can be treated by providing purified proteins which can be taken up 
by defective cells to correct their genetic defect. Examples include Beta 
cerebrosidase in patients with Gaucher's disease and other genes 

15 defective in lysosomal storage disorders (13). A particular attraction of the 
CD68 LCR for treatment of lysosomal storage diseases is the fact that 
CD68 is expressed in microglia, mononuclear phagocytic ceils resident in 
the brain. Some microglial cells in adults are derived from recruited blood 
monocytes and hence the CD68 LCR may offer the possibility of 

20 expressing therapeutic gene products in the brain via recruited blood 
monocytes transfected with CD68 LCR vectors. 

Treatment of infectious diseases 

Several important human pathogens survive and replicate in 

25 the endosomai compartment of macrophages. These include the 

causative agents of tuberculosis, ieismaniasis and leprosy. The HIV virus 
survives and replicates in monocytes. Targeting y-interferon production to 
macrophages infected with Mycobacterium bovis could be a viable strategy 
for treatment of Tb. The delivery of HIV decoy tar s qu noes and other 

30 anti HIV reagents to monocytes and macrophages would create a pool of 
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monocytes resistant to HIV infection. Similar gene therapy protocols for 
the "intracellular vaccination" of T-lymphocytes have been proposed. 

MaQrophggg Expreggipp Systems 

5 The original p- giobin LCR vectors have been used to 

develop expression systems for the over production of heterologous gene 
products in cultured erythroid cell lines(14). The CD68 LCR could be used 
for over expression of heterologous genes in human and murine 
macrophage cell lines for instance in the production of cytokines and 

to soluble receptors whose pattern of giycosylation was important for their 
biological activity. 

Qgnetjc Vagcinatipn 

Recently it was shown that naked DNA could be used in 

15 animals to confer protective immunity against lethal virus challenge and to 
elicit significant humoral antibody responses, Cun-ent protocols for genetic 
vaccination use standard mammalian expression vectors based on the 
SV40 or HCMV promoters and enhancers. Heterologous genes are 
expressed in myofibres but the cells presenting foreign antigen for the 

20 elaboration of an effective immune response are unknown (15). Using a 
CD68 LCR vector to target heterologous gene expression to macrophages 
and possibly dendritic cells should increase the efficiency and efficacy of 
genetic vaccination protocols. 

25 IfnnmnQthgrapy 

Dendritic cells process antigens for presentation to cells of 
the immune.system. Human dendritic, cells expressing the CD68 antigen ^ , 
are important in conferring tolerance in organ transplant rejection and 
autoimmune diseases. A CD68 LCR will allow for the genetic targeting of 

30 an important subset of Dendritic cells grown in culture from bone marrow or 
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peripheral blood precursors. 
Compositions 

The invention also relates to compositions comprising the 
poiynucleotide, vector or transfected cell. Thus, the polypeptides of the 
5 present invention may l>e employed in combination with a non*steriie or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a 
pharmaceutical carrier suitable for administration to a subject. Such 
compositions comprise, for instance, a media additive or a therapeutically 
effective amount of a polypeptide of the invention and a pharmaceutically 
10 acceptable carrier or excipient. Such carriers may include, but are not limited 
to. saline, buffered saline, dextrose, water, glycerol, ethanol and 
combinations thereof. The formulation should suit the mode of 
administration. 

Kits 

15 The invention further relates to diagnostic and pharmaceutical packs 

and kits comprising one or more containers filled with one or more of the 
ingredients of the aforementioned compositions of the invention. Associated 
with such container{s) can be a notice in the fonm prescribed by a 
governmental agency regulating the manufacture, use or sale of 

20 phamiaceuticals or biological products, reflecting approval by the agency of 
the manufacture, use or sale of the product for human administration. 
Administration 

Polynucleotides and other compounds of the present invention may 
be employed alone or in conjunction with other compounds, such as 
25 therapeutic compounds. 

The pharmaceutical compositions may be administered in any 
effective, convenient manner including, for instance, administration by 
topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular, 
subcutaneous, intranasal or intrademnal routes among others. 
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The pharmaceutical compositions generally are administered in an 
amount effective for treatment or prophylaxis of a specific indication or 
indications. In general, the compositions are administered in an amount of 
at least about 10 pg/kg body weight, in most cases they will be administered 
in an amount not in excess of about 8 mg/kg body weight per day. 
Preferably, in most cases, dose is from about 10 pg/kg to about 1 mg/kg 
body weight, daily. It will be appreciated that optimum dosage will be 
determined by standard methods for each treatment modality and indication, 
taking into account the indication, its severity, route of administration, 
complicating conditions and the like. 

In therapy or as a prophylactic, the active agent may be 
administered to an individual as an injectable composition, for example as 
a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical 
application for example in the form of ointments, creams, lotions, eye 
ointments, eye drops, ear drops, mouthwash, impregnated dressings and 
sutures and aerosols, and may contain appropriate conventional additives, 
including, for example, preservatives, solvents to assist drug penetration, 
and emollients in ointments and creams. Such topical formulations may 
also contain compatible conventional carriers, for example cream or 
ointment bases, and ethanol or oleyl alcohol for lotions. Such carriers may 
constitute from about 1% to about 98% by weight of the formulation; more 
usually they will constitute up to about 80% by weight of the formulation. 

For administration to mammals, and particularly humans, it is 
expected that the daily dosage level of the active agent will be from 0.01 
mg/kg to 10 mg/kg, typically around 1 mg/kg. The physician in any event 
. will determine the actual dosage whigh will be most suitable for an 
individual and will vary with the age, weight and response of the particular 
individual. The above dosages are exemplary of the av rage case. There 



wo 97/42337 



PCT/GB97/01209 



.35- 

can. of course, be individual instances where higher or lower dosage 
ranges are merited, and such are within the scope of this invention. 

A vaccine composition is conveniently in injectable form, or 
may be administered by other techniques, for example a gene gun using, 
5 for example, gold particles, or SK wVo Conventional adjuvants may be 
employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5^g/kg of antigen, and 
such dose is preferably administered 1-3 times and with an interval of 1-3 
weeks. 

10 With the indicated dose range, no adverse toxicological effects will 

be observed with the compounds of the invention which would preclude 
their administration to suitable individuals. 

The polynucleotide sequences, vectors and host cells of the present 
invention may be used in the treatment of infectious diseases including 
15 viral and bacterial infections such as HIV and TB, inflammatory diseases, 
cardiovascular diseases, rheumatoid arthritis, atherosclerosis, restinosis, 
cancer, for example cancer of the bowel, colon, breast or lung. 

Othgr UCR Patent? 

20 The published LCR most similar in targeting transgene 

expression to macrophages is the Class il LCR . This element has been 
claimed to direct expression to (presumably activated) macrophages, 
dendritic cells and B-lymphocytes (16). Our data with stably transfected 
murine macrophage line RAW 264.7 and the murine B-cell line A20 already 

25 show the CD68 cosmids direct expression which is restricted to 
macrophages (Figures 5 8 6). 
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Seq ID No. 1 

REVERSE -COMPLEMENT of: Drg 237. Con check: 1236 from: 1 to: 3601 
Drg237.rev Length: 3601 April 29, 1996 11.13 Type N Check: 6126 

1 TGTCCTGGAA CCCAGGTGCC TACCTGGTCT GCTGCATATT TGTTTTCTCT 

51 TCCAGCATGG AGATATGGnA CCAAAAGGAA CGAGTGCTCA GAGTTTTGAT 

101 TACCAnTGAC CTGCTGGTGA GTAGAGGGAA CTGATAGCAA AGGCAGAAGG 

151 GAGGATCCAA GGTGATTCCC TCTCCAAGGC AAGTTCGGAA AGTAGCAGCT 

201 TGGAATAGAA TCTGGCATGC CTAAGGCCTT TGGGGAACTG GGATGCTTAT 

2 51 TTCCTCTGCC TTCCTTGGCT GCCCACATGG ATGCCTAAGT GTCTTCCCTC 
301 CGGGATAGAG TGTCCTCCGT GCACATGCTG AAGAGTTGTC TTTCTTGACG 

3 51 TAGGCCAGAG GCATTGATGT GCAGCAGGTT TCTTTAGTCA TCAAcTATGA 

4 01 CCTTCCCACC AACAGGGAAA ACTATATCCA CAGGTAAGCG TAGATCTGGA 
451 ACATTCCCAn ACCCTTTCAC ACcTGGCCcT CCCTGGGCTT AAAGCTCCTG 
501 ATATTCCTCA TCCCCTTCCT TGTTTTCCAG AATCGGTCGA GGTGGACGGT 
551 TTGGCCGTAA AGGTGTGGCT ATTAACATGG TGACAGAAGA AGACAAGAGG 
601 AyTcTTCGAG ACATTGAGAC cTTCTACAAC ACCTCCATTG AGGAAATGCC 
651 CCTCAATGTT GCTGACCTCA TCTGAGGGGC TGTCcTGCCA CCCAsCCCCA 
701 gCCAsgGcTC AAkyTcTGGG GGCTGAGGAk CwgCAGGAGG GGGGAGGGAA 
751 GGGAGCCAAG GGATGGACAT CTTGTcAtTT TTTTTtCTTT GAATAAATGT 
801 CACTTTTTGA GGCAAAAGAA GG7VACCGTGA ACATTTTAGA CACCCTTTTC 
851 TTTGGGGTAG GCTCTTGCCC CAGGCGCCGG CTCTTCTCCC AAAAAAAAAA 
901 AAAAAACAcT AaTCCATTTC CCTAACcTAg TAACcTCCAG ATCCCAGAGG 
951 CTCTCCTCAC CTCAGCTGAG CTCCTTTGAA AGTGATTCAA GGGAcTATGT 

1001 CAcTCAgCcT CATTTGcTGG ACCAAATcTG GAGGGAGAAC CCCTAAAACC 

1051 CcTAAGTGAG GTTGCCCAGG GGGTTGTCCC CAGGTGGGGG GAAGCAGGGG 

1101 ■ AGAGAAAATG GTAGCCATTT TtACATTGTT TTGTATAGTA TTTATTGATT 

1151 CAGGAAACAA ACACAAAATT cTGAATAAAA TGACTTGGAA ACTGCCTGTT 

1201 TGGGCTTCTC ATTTCTtACC TCCCCTTCCC TCTCCCACCT GmTAcTGGGT 
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1251 


GCATcTCTGC 


T . CCCCCCTT 


. CCCCAGCAG 


ATGGTTACCT 


TTGGGCTGTT 


1301 


GCTTTCTTGT 


CACCATCTGA 


GTTCTCAGAC 


GCTGGAAAGC 


CATGTTCTCG 


1351 


GCTCTGTGAA 


tGACAATGcT 


GAcTGGAGTG 


CTGCCCCTCT 


GTAAAGGGcT 


1401 


GGGTGTGGAT 


GGTCACAAGC 


CCtTcACATG 


CyTCAGCCAA 


GAgGAAGTAG 


1451 


tACAGGggTC AGCCCAgAgg 


TCCAGGGGAA 


AgGAgtgGAA 


AcCGATTTCC 


1501 


CCACCAA.GG 


GAgGGGCcTG 


TACcTCAgcT 


GTTCCCATAg 


cTACTTGCCA 


1551 


CAACTGCCAA 


GCAAgTTTCG 


cTGAgTTtGA 


CACATGG.AT 


CCC.TGTGGA 


1601 


TCAAcTGCCC 


TAgGAcTCcG 


TTTGCACCCA 


TgtgACacTG 


ttGAcTTTGC 


1651 


CCTGAcgAa . 


gCAgggcCAa 


cagtccccta 


AcTTAATtAC 


aAaAAcTAAT 


1701 


GAcTAAGAgA 


gAgGTGGcTA 


gAgCTGAgGC 


CCCTG . AgTC 


AgGcTGTGGG 


1751 


TGGGATCaTc 


tCCAgTACAG 


GAAgtGAGAc 


TTTCATTT , C 


ctCCtTTcCA 


1801 


Ag . AgAgGGc 


TGAgGGAgCa 


gGGTTgAgCa 


ActGGTGCAg 


ACAgCCTAgc 


1851 


TGGActTTGG 


GTgAgGc . , g 


gtTCAgCCAT 


gAgGctGGcT 


gTgCttTtcT 


1901 


cGGGGGcCCT 


GcTGGGGcTA 


CTGGCAGGTA 


AGGAGGAAgG 


AgGcTGAGGG 


1951 


GAGGGGG. .C 


CCCTGGGAGG 


GAGCcTG.CC 


CTGGGTTGct 


AACCATcTCC 


2001 


T. . . .ct .CT 


GCCAAAAGCC 


CAGGGGACAG 


GGAATGAC . T 


GTCCTCACAA 


2051 


AAAATCAGcT 


ACTTtGcTGC 


CATCCtTCAC 


GGTGACACCC 


ACGGtTACAG 


2101 


AGAGCACTGG 


AACAAcCAGC 


CACAGGACTA 


CCAAgAGCCA 


CAAAACCACC 
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Seq ID No 2 
TTCCAAGAGAGGGCTGAGGGAGCAGGGTTGAGCAACTGG 

5 

TGCAGACAGCCTAGCTGGACTTTGGGTGAGGCGGTTCAGCCASiaAaTCC 
TnrTnc;GGCTA CTGGCAGataaaaaaCCCAG 
10 gaaggaggctgaggggagggggcccctgggagggagcctgccctggg 
ttgctaaccatctcctctctgccaaaagCCCAG 
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aattcggttctccaatcccctgggtcactttgctcttgtgcacgctttcc 
agtctttcagcgtaagccagagtcattcccaaggatgctggtttctctct 
gggggaagagctgctctgtgatggagcccatgcgtgtcatctgagcctct 
ggcttccctgccagtgcagccctggcagtgtcctacttcccagggctgtt 
gtctgcctggcgggaaggtcctgggcaaaggatcagtctttgtactctga 
gagcagactacttggctcctctctgttttttatcagcgaagttggatata 
tctctcccacatttccctaatcatatgctatatattggctttttttttct 

^_r;^.flhaa CC:CCCAAATACATCAAGATC TTTGTACTGGATGAAGCTGACGA 
AATGTTAAGCCGTGGATTCAAGGACC AGATCTATGACATATTCCATAAGC 
TCAACAGCAACACCCAGataaaaacaatcttqcttqaataQctaatqatt 
cttgaaaaatagtaagtgccaggggaaaccaaatactggattcttgagcc 
tttttatgcatctgcttcagttttaggtgtggctagggaagggagcaggc 
ctcaggaaggaaccagcactctaagactggcctttttttccactagGias 

TTTTGCTGTCAGCCACAATGCCTTCTGATGTGCTTGAGGTGACCAAGAAG 
TTCATGAGGGACCCCATTCGGATTr TTGTCAAGAAGGAAGAGTTGACCCT 
GGAGGGTATGCGCCAGT TCTACATCAACGTGGAACGAGAGqtqqqqccca 
gtgcaggaggcgggcctggtagtgagttgttgggtatagcccctgactga 

1- 1- M- ^ gi- r^r^r^r^^^ r r-r-agG AHTGGAAGCTGGACACACTATGTGACTT 
GTATGAAACCCTGACCA TrArCGAGGCAGTCATCTTCATCAACACCCGGA 
GGAAGGTGGACTGGCTCAGCGAGAA GATGGATGCTCGAGATTTCAGTGTA 
TCCGCCATG atatatttqcccqctqccaqcctqttqtgggtctqcccqtc 
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agaagtgtcctacttgaagccagggttcctggaacccaggtgcctacctg 

g^r■^g^!^goat-a^^t-g^^^-^-ct•f;t- 1 cragCATOGAGATATnnACCAAAAG 
OAACGAGACGTGATTATnAQnflAGTT TCGTTCTGGCTCTAGCAGAGTTTT 
GATTACCA CTGACCTGCTCgngagtagaaaaaactaataQcaaaaacaaa 
agggaggatccaaggtgattccctctccaaggggacatcagtgcctctca 
ggaaagtagcagcttggaatagaatctggcatgcctaaggcctttgggga 
actgggatgcttatttcctctgccttccttggctgcccacatggatgcct 
aagtgtcttccctccgggatagagtgtcctccgtgcacatgctgaagagt 
hgtcttitgtir-gacgtaaGCCAGAGGCATTGATGTGCAGCAGGTTTrTTTA 
GTCATCAAGTATGArCTTCCGACCIAACAGGGAAAACTATATCCACAGgtia 
agcgtagatctggaacaytcccnt accent tcacacctggccctccctgg 
QCttaaagctcctgatattcctcatccccttccttatt-ttccag AATCGG 
TGGAGGTGGAGGGTTTGGrrGTAAAGGTGTGGGTATTAACATGGTGACAG 
AAGAAGACAAGAGGANTCTTCGAGAgATTGAGACCTTCTACAArArrTrr 
ATTGAGGAAATGCCCCTCAATGTTGCTGACGTCATCTGAGGGG CTGTCCT 
GCCACCCASCGCCAGGCA GGGCTCAAAGTCTGGGGGCTGAGGACCTGCAG 
GAGGGGGGAGGGAAGGG AGGCAAGGG ATGGAGATGTTGTrATT TTTTTTT 
CTTTGAATAAATGTCArTTTTTGAGGCAAAAGAAGGAACCGTGAACATTT 
TAGACACCCTTTTCTTTGGGGTAGGCTCTTGCCCCAGGCGCCGGCTCTTC 
TCCCAAAAAAAAAAAAAA AACAGTAATC rATTTCCGTAArCTAGTAArPT 
CCAGATrCCAGAGGrTrTCrTCACCTGAGCTGAGC TCCTTTGAAAGTGAT 
TPAAGGGAgTATGTPArrr'AGrrTrATTTGrTGGArr'AAATrTGriaGr.r.a 
GAAGCrrT AAAAGrrCTAAGTGAGGTTGGrCAGGGGGTTGTCCCrAGGTG 
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GGGGCgAAGCAGGGnAGAOAAAATnGTAGCrATTTTTACATTnTTTTnTAT 

AGTATTTATTGATTrAGGAAACAAACACAAAATTCTGAATAAAATGArTT 

GGAAAGTGGCTG TTTGGGOTTGTCATTTCTTArrTrrrrTTrrrTrTrPr 

ACCTGCTACTGGGTGCATCTCTGCTCCCCCCTTCCCCAGCAGATGGTT 

ACCTTTGGGCTGTTGCTTTCTTGTCACCATCTGAGTTCTCAGACGCTGGA 

AAGCCATGTTCTCGGCTCTGTGAATGACAATGCTGACTGGAGTGCTGCC 

CCTCTGTAAAGGGCTGGGTGTGGATGGTCACAAGCCCCTCACATGCCTCA 

GCCAAGAGGAAGTAGTACAGGGGTCAGCCCAGAGGTCCAGGGGAAAGGAG 

TGGAAACCGATTTCCCCACCAAGGGAGGGGCCTGTACCTCAGCTGTTCC 

CATAGCTACTTGCCACAACTGCCAAGCAAGTTTCGCTGAGTTTGACACAT 

GGATCCCTGTGGATCAACTGCCCTAGGACTCCGTTTGCACCCATGTGA 

CACTGTTGACTTTGCCCTGACGAA . GCAGGGCCAACAGTCCCCTAACTTA 

ATTACAAAAACTAATGACTAAGAGAGAGGTGGCTAGAGCTGAGGCCCCTG 

AGTCAGGCTGTGGGTGGGATCATCTCCAGTACAGGAAGTGAGACTTTCA 

TTTCCTCCTTTCCAAGAGAGGGCTGAGGGAGCAGGGTTGAGCAACTGG 

TGCAGACAGCCTAGCTGGACTTTGGGTGAGGCGGTTCAGCCAIJSaGQC 

TGGCTGTGGTTTTCTCGGG GGrrCTGGTGGGGCTACTGGrAGgt aaggag 

gaaggaggctgaggggagggggcccctgggagggagcctgccctggg 

htgcil-aarirat-nfffrt-rl-rt-ggcaaaaa CCCAGGGGACAGGGAAT 

GACTGTCrT rArAAAAAATGAGnTACTTTGCTGCCATCCrrTrArGGTGA 

CACCCACGG TTArAGAGAGCArTGGAACAACCAGCCArAGGAPTArrAAG 

AGCCACAAAACrACrACTr APAGGACAACCACCACAGGCAgCArrAGr 
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rArnnArccArnArTr,cc;^ rTr&rAACcrcACCACCArcAGCCATGgA 

AArfiTrACArtTTrATrCA A f-AArirAATAGCACTGCCArCAGCCAGGGACC 

rTrAACTfiCCArTrAr.AGT c rTcrrACCArTAGTCATGGAAATGCCACGG 

TTrATrrAArAAnrAACAnrArTnrcAr rAGrrrAGGATTCACCAGTTCT 

r;rrrArccAr,AArrArcT rrArrrTrTrcGAGTCCTAGCCCAACCTCCA 

AnnAr,Ar-CA T Tr,r;ar;ArTArArGTGGAGCAATGGTTCCCAGCCCTGTGT 

rrArrTrrAArnCCCAGAT TrAGATTCGAGTCATGTACACAACCCAGGGTG 
GAGGAGAGa taaaqctaaaactcTQqqqatqagaggggagggaggcaggac 
tggttataggctcagagggaagaaggaagaggggacaggnaaccttggcc 
ggcatcgcatgcagtcttgtgaccttccagtctttaacttccgcagSSCI 

GGGGTATGTrTGTWGTGA NGrGGAACAGAAGGAAGGTCCAGGGAAGCTGT 

GGGGGTGCrrATrrGrArrTGCTTCT rTrATTrrrCTATGGACACCTCAG 

GTTTGGATTCATGCAG qtataqccatqacctcaqtctcacccctcactca 

gcctcccggcgcccctcccctcccaatcccacacgctactccttcctctg 

tggagagggataccacctgcgccttcctcttcgccccacagGACCTCC 

AGGAGAAGGTTGTCTACG TGAGrTACATGGGGGTGGAGTACAATGTGTC 

CTTCCCCCAGGCAGCACq taaqtaacctccttccctttctcattgctacc 

actagacgccagggttcctgaaaggactaagctggggccagggaggtgg 

ataggatctgacccttcctcactcctccagftGTGGACATTCTCGGCTCAG 

AATGrATPrrTTPnarSAT rTrrAAGCArrCCTGGGGCAGAGCTTCAGTTG 
GAGGAAGTrriAGGATCA TTrTTTCACrAGCTGTCCACCTCGACCTGCTCT 
. CGTTGAGGrTrrAGGrTnrTCAGGT ngrffCAGArAGGGGTCTTTGGGCAA 

^gtaagacctacctactccttccctcctagaatcctcccactgcactgaa 
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aaccccttccccaggcccataagccactcatctctcttcttaacccccca 

aatctcgctctcccagcttgtcatggctacagggcagctttctttccatc 

ctctacaagactctgccagtttcccccttttatcactgctgagtcactgc 

ggtgagctcctcaccaatctcctactccccagcatccccccattccctcc 

tcccacctttatcccaaccagcacgtcactgcaaatacctacctgcccta 

hfjnttrcgccag GTTTCTCCTGCCCC AriTflACCGGTCCATrTT GCTGCCT 

CTCATCATC GGCCTGATrCTTCTTGGCCTCCTCGCCCTGnTGCTTATTGP 

TTTCTGCATrATCCGGAGACGCCCATCCGCCTACCAGGCCCTCTGAGCAT 

TTGCTTCAA AGCCGAGGGCACTGAGGGGGTTGGGGTGTGGTGGGGGGGT 

AGGCTTATTTCrTCGAGA PGGAArTGGGTPAAAGACAATGTTATTTTrrT 

TGCGTTTGTTGAAGAACA AAAAGAAAGrCGGGCATGACGGrTCATGCrTG 

TAATCGCAGCACTTTGGQA GGGTGAGGCAGGTGGATCACTGGAGGTGAGG 

AQTTTGAGAGGAGCCTG GrrAArATGGTGAAACCCTGTCTCTACTAAAAA 

TACAATTAGGCAGGTGT GGGGGGGTAATrGGAGCTGGCCTGTAATGCCAG 

CTACTTGGGAGGCTGAGG rAGAAGTGCTTGAAGCCAGGAGGTGGAGGTTG 

CAGTGAGCGGTGATCGCG rGAGTGAGCCAAGAGTGGrGCCAGTGGArTrr 

AGCCTGGGC GAGAGAGCGAGACTGTCTCAAATAAATAAATATGAGATAAT 

GCAGTCGGGAGAAGGGAG GGAGAGAATTTTATTAAATGTGACGAACTGCC 

CCCCCCCCCCCCCCCCAGGAGGAGA GCAGCAAAATTTATGCAAATCTTTG 

ACGGGGTTTTCCTTGTCC TGrrAGGATTAAAAGCCATGAGTTTGTTGTrA 

CATGCCTTTOTATGCCTTrCATGGG TGGGTGTCAGGGAGCCGGAAGCAGr 

TGCTGAGGA GGGATGAAAATGTCAGTGTGTGACGATGGCTGATGGGTTrA 
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prrccCAAAnrrTnGCACA nrTnr^TnTTnnGTCTGCCGTGCCTCCCTTCC 

TTrrTrPTCTTnr,r,c;rrAr Tr;r;rTr,CTcrAr,TTCCCCATCCGTGGCAAGC 

rr,r,Tar,Ar,rrATTrATCCCC r -,rAr.rrTTrTTCCTGACCCTCGTACAGTTT 

rRAATGCAGrAr;ArAnC:rA&Af:irAATG AGTr,GGGGGrTGTriGAACTTCAT 

TrrrAAAGGrAGCGCCAG Tr!r!<-Tr:CTGAr;rAATGAGAATGTCCTGTCCTG 

TrrArGATATTrAAGGC ranrAGAAGAGGPrGATTAAACCCTCGCAGCGA 

rrTGGGATGGTCCTATrrrArCTGC AAGGGGTTGAATCAAGAAGGAGCAG 

TrinnTACTrTrtArGTGrArTGGGGGn TrGTGGGAACAGCATGCCCCCCAC 

ArGGGGrGACrTGCCAAnrrTAAr TTGATGCCGCCAGTACTTGAGATGAG 

rjAGTGTGArTrTGAGGA rAGrGAAGGTrCAGATTCTAGAAAGGACCTCCC 

AGATGGGGACAGGGTGCArrAGGAGTG AGrGrGAGTCGCACCCATTACAG 

rTGGrTAGGGCGCAATrrrTGGGAG rrAnrtATGAGCAGCACCCCCCAGCC 

GTAGGAGCrrrAGGAGGrTTrPGGGTT rrAAr;GGC|N4AGAGACTGCCCCAC 

AAGGSAGrrrTGACCTG nrAGnGrGPArirAAGrrCCACCTCTGCCTGCAG 

ArATPrGTRTGACCTTnTAGACTT TGGAGGGGGGCCCCAAAGGGCTGATC 

rArAGGGGAATGACGrAPGGGTGGn PArrGTGQGTTGGCGTCCCGGCGGT 

rr;GTAACGAAc;rACAAC rirprrGAGGAGGTAGTCCAAGGTGCCCTTTTCC 

rrAAGCAGrGGTGMAGG TTTrMATGTGGCCTCGCCGAAGTCTCTGCCCAGC 

ACGTGGTGrrrGTCGGn rrAGGGGGGAGGGGGCGAATCCCGGGACTGCTC 

rifGAGGPGTGGGGTPrrrGGAG ArTrTTGGGGGTSTGGGCCCCAAGGGTG 

ATTrAGGTGGTGGCrTTyrrGGG ArrTGGGATGCTTCCCCCACGTCTTCT 

TrpTgrTTTAATGTrcCGG rtrPgAGgAGTTGCCGGCGCAATTCATGYTGCG'- 

AGGrr!TGAGrrAAPGGG; \r.farr:Ar;ACAAr;rArAGGGGGCTGrGGGCAACG 
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CGGCACCTA AGnAGOCrTGCCCGQTGCAGACTCTCCT nrTrnnACCOnrr; 
CCrTTCCCTCTAGAGACGCTGAGAGAACGGGAGCTAGTAGrnCCrrrArr 
CAACGCrACCTCGGAGACTrCGGCTCCTTCTCTrTCAArTTr GAACAATA 
CAAAGTGTGCTAGGAGAAGACAAGATGGCGCCCAGGAGG AGGAGC-GGAGA 
AAGGrAGGGGTGTAAATCTGGCTTCCAAACTGGAAGC GTGAArAAAGGPG 
TGGGAGGTOTAAGGGGGGAGGCGTGCAGCTTCGGGAAGfTT 
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Sequence ID No 4 

ctagctg'gtctgagcatctctgcc ATGCGGCTCCCTGTGTGTCTGATCT TGCTAGGACCGC 

HfiJ^gtaaggagaaatgggaggtgggggagggagggctcatgggcaggagcctgcacctgggtggcc 

aaccaatctcttactgaaag CCCAAGGAACAGAGGAAGACTGTCCTCACAAAAAGGCCG 

TTACTCTCCTGCCATCCTTCACGATGACACCTACAGCCACAGAAAGCACAGCCA 

GCCCTACGACCAGCCACAGGCCCACCACCACCAGTCACGGAAATGTCACAGT 

TCACACCAGCTCCGGACCCACAACTGTCACTCATAACCCTGCCACCACCACC 

AGTCATGGGAATGCCACAATTTCTCATGCCACAGTTTCTCCCACCACAAATGGC 

ACTGCTACTAGTCCAAGATCCTCCACTGTTGGCCCTCACCCTGGACCACCTCC 

ACCCTCGCCTAGTCCAAGGTCCAAGGGGGCTCTTGGGAACTACACGTGGGCCA 

ACGGCTCCCATCCTTGTGTTCAGCTCCAAGCCCAAATTCAAATCCGAATCCTAT 

ACCCAATTCAGGGTGGAAGAAAG ataaaQCtaaaQtaaggcttaaaaagggnaagaggcaa 

gtcctgggctcgttcagcagggaagaggaagagaagaggaggggataaactggatggagcattcttgtgattt 

cagacccaccattgcacttctacag GCTTGGGGCATATCTGTTTTGAATCCCAACAAAACC 

AAGGTCCAGGGAGGTTGTGACGGTACCCATCCCCACCTGTCTCTCTCATTTCC 

TTATGGACAGCTTACCTTTGGATTCAAACAG Qtatacagcttgagmgtrtr.fatr.r.tr.tattrttp.r. 

atatcccatacctgtacccccggagcctctgttcttgctctgtggacatggatgcctctgtccctgatgccttgagtctttyt 

gttcaccttaag GACCTACATCAGAGCCCGAGTACAGTCTACCTGGACTACATGGCG 

GTGGAATACAATGTGTCCTTCCCACAGGCAGCAC gtgagtaatctcttctccttaccacacta 

aaagtctaggctgggcgtgctgggctggtggggaggactcaggagtcaggactggatttgactcttaattactaat 

tactgcag AGTGGACATTCATGGCGC AGAATTCA TCTCTTCGAGAGCTCCAAGnTC 

CCTTGGGCCAAAGCTTCTGCTGTGGAAATGCAAGCATAGTTCTTTCTCCAGnTG 

TTCACCTTGA CCTGCTCTCTCTAAGGCTACAGGCTGCTCAGCTGnnTGAnAAGG 

GACACTTCGGGCCAT gtaagccctacctacttcttctttcctagagctctcccagtgctctggaaaccttccc 
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cagaactctttttctagacctccgcctcctctacaagactgccttaWcccctttgctgactgctctcacctat^ 
tcctcagcctttcttcatcccctttcctcttccgtcctcccccacctcccacctttagcccgtccaccactgcagcaatt^^ 
cagggatcccattattctgccag GTTTCTCTT GCAACCGTGACCAGTCCCTCTTfiCTnnn 
TCTCATCATTGGCCTGGTCCTCCTCGGC CTCCTCAnCCTGGTGCTCATCGnn 
TTCTGCATCACCCGCAGACGACAATCA ACCTACCAGCCCCTCTGA gcatctgcccc 
agtccactgtg 
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CLAIMS 

1. A cloned polynucleotide having the function of a 
5 transcriptional regulatory sequence (trs) and comprising: 

(a) a polynucleotide fragment having at least 70% identity 
to the polynucleotide of Seq ID No. 2; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a); or 

10 (c) a polynucleotide comprising at least 1 5 sequential 

bases of the polynucleotide of (a) or (b). 

2. A polynucleotide according to claim 1 wherein the 
polynucleotide fragment has at least 80% identity to the polynucleotide of 
Seq ID No. 2. 

15 3. A polynucleotide according to claim 1 or 2 wherein the 

polynucleotide fragment has at least 90% identity to the polynucleotide of 
Seq ID No. 2. 

4. A polynucleotide according to claim 1 , 2 or 3 comprising the 

polynucleotide of Seq ID No. 2. 
20 5. A polynucleotide comprising the transcriptional regulatory 

sequence of CD68. 

6. An expression cassette comprising a polynucleotide 

according to any one of the preceding claims and a polynucleotide 
operatively linked thereto encoding a heterologous polypeptide. 
25 7. An expression vector comprising the expression cassette as 

claimed in claim 6. 

8. A host cell comprising^a vector as claimed in claim 7. 

9 A process for producing a polypeptide which process 

comprises transforming or transfecting a cell with a vector as claimed in 
30 claim 7 and culturing the transfomned or transfected cell. 
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10 A vector for the integration of an heterologous gene into the 

genome of a mammalian host cell such that the gene may be expressed in 
the host cell, the vector comprising a transcriptional regulatory sequence, 
5 the said gene and a Locus Control Region capable of eliciting host cell- 
type restricted, integration site independent, copy number dependent 
expression of said gene, characterised in that the Locus Control Region is 
located within a region extending from 14kb upstream to 25 kb 
downstream of the CD68 gene. 
]0 11 A vector as claimed in claim 10, wherein the Locus Control 

Region is located within a region extending from 5.5 kb upstream to 12 kb 
downstream of the CD68 gene. 

12 A vector as claimed in claim 1 1 , wherein the Locus Control 
Region is located within a 3 kb BstX1-BstX1 locus immediately upstream of 

15 the CD68 gene. 

13 A mammalian host cell selected from macrophages, 
monocytes and dendritic cells and their precursors, transformed with a 
vector as defined according to any one of claims 10 to 12. 

14 A mammalian host cell as claimed in claim 1 3 which is a 
20 mature macrophage. 

15 A method of producing a polypeptide, which method 
comprises culturing a mammalian host celt according to claim 13 or claim 
14. 

16 A method of modifying mammalian stem cells and progenitor 
25 ceils comprising transforming mammalian stem cells with a vector 

according to any one of claims 10 to 12. 

1 7 Mammaliaastem cells and progenitoc^cjgjls, modified as 
claimed in claim 16, for use in the treatment of a disease condition in a 
human or animal body. 

30 18 Use of a vector according to any one of claims 1 0 to 12 or 



BNSDOCID: <WO 9742337A1_L> 



wo 97/42337 



53 



PCT/GB97/01209 



mammalran stem cells or progenitor cells according to claim 17. for the 
manufacture of a medicament for the treatment of a disease condition of 
the human or animal body caused by a gene deficiency. 

19 An isolated polypeptide which is expressed under the control 
5 of a CD68 transcriptional regulatory sequence. 

20 An isolated polypeptide which is expressed under the control 
of a transcriptional regulatory sequence comprising a nucleotide sequence 
as claimed in any one of claims 1 to 5 

21 A polynucleotide as claimed in any one of claims 1 to 5, an 

10 expression cassette as claimed in claim 6, an expression vector as claimed 
in claim 7 or a host cell as claimed in claim 8 for use in medical therapy. 

22 Use of a polynucleotide as claimed in any one of claims 1 to 
5, an expression cassette as claimed In claim 6, an expression vector as 
claimed in claim 7 or a host cell as claimed in claim 8 in the manufacture of 

1 5 a medicament for use in therapy. 
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1 TGTCCTGGAA CCCAGGTGCC TACCTGGTCT GCTGCATATT TGTTTTCTCT 

51 TCCAGCATGG AGATATGGnA CCAAAAGGAA CGAGTGCTCA GAGTTTTGAT 

101 TACCAnTGAC CTGCTGGTGA GTAGAGGGAA CTGATAGCAA AGGCAGAAGG 

151 GAGGATCCAA GGTGATTCCC TCTCCAAGGC AAGTTCGGAA AGTAGCAGCT 

201 TGGAATAGAA TCTGGCATGC CTAAGGCCTT TGGGGAACTG GGATGCTTAT 

251 TTCCTCTGCC TTCCTTGGCT GCCCACATGG ATGCCTAAGT GTCTTCCCTC 

301 CGGGATAGAG TGTCCTCCGT GCACATGCTG AAGAGTTGTC TTTCTTGACG 

351 TAGGCCAGAG GCATTGATGT GCAGCAGGTT TCTTTAGTCA TCAAcTATGA 

401 CcTTCCCACC AACAGGGAAA ACTATATCCA CAGGTAAGCG TAGATCTGGA 

451 ACATTCCCAn ACCCTTTCAC ACcTGGCCcT CCCTGGGCTT AAAGCTCCTG 

501 ATATTCCTCA TCCCCTTCCT TGTTTTCCAG AATCGGTCGA GGTGGACGGT 

551 TTGGCCGTAA AGGTGTGGCT ATTAACATGG TGACAGAAGA AGACAAGAGG 

601 AyTcTTCGAG ACATTGAGAC cTTCTACAAC ACCTCCATTG AGGAAATGCC 

651 CCTCAATGTT GCTGACCTCA TCTGAGGGGC TGTCcTGCCA CCCAsCCCCA 

701 gCCAsgGcTC AAkyTcTGGG GGCTGAGGAk CwgCAGGAGG GGGGAGGGAA 

751 GGGAGCCAAG GGATGGACAT CTTGTcAtTT TTTTTtCTTT GAATAAATGT 

801 CACTTTTTGA GGCAAAAGAA GGAACCGTGA ACATTTTAGA CACCCTTTTC 

8 51 TTTGGGGTAG GCTCTTGCCC CAGGCGCCGG CTCTTCTCCC AAAAAAAAAA 

901 AAAAAACACT AaTCCATTTC CCTAACcTAg TAACcTCCAG ATCCCAGAGG 

951 CTCTCCTCAC CTCAGCTGAG CTCCTTTGAA AGTGATTCAA GGGAcTATGT 

1001 CAcTCAgCcT CATTTGcTGG ACCAAATcTG GAGGGAGAAC CCCTAAAACC 

1051 CcTAAGTGAG GTTGCCCAGG GGGTTGTCCC CAGGTGGGGG GAAGCAGGGG 

1101 AGAGAAAATG GTAGCCATTT TtACATTGTT TTGTATAGTA TTTATTGATT 

1151 CAGGAAACAA ACACAAAATT cTGAATAAAA TGACTTGGT^ ACTGCCTGTT 

1201 TGGGCTTCTC ATTTCTtACC TCCCCTTCCC TCTCCCACCT GmTAcTGGGT 

1251 GCATcTCTGC T,CCCCCCTT . CCCCAGCAG ATGGTTACCT TTGGGCTGTT 

1301 GCTTTCTTGT CACCATCTGA GTTCTCAGAC GCTGGAAAGC CATGTTCTCG 

1351 GCTCTGTGAA tGACAATGcT GAcTGGAGTG CTGCCCCTCT GTAAAGGGcT 

14 01 GGGTGTGGAT GGTCACAAGC CCtTcACATG CyTCAGCCAA GAgGAAGTAG 

14 51 tACAGGggTC AGCCCAgAgg TCCAGGGGAA AgGAgtgGAA AcCGATTTCC 

1501 CCACCAA.GG GAgGGGCcTG TACcTCAgcT GTTCCCATAg cTACTTGCCA 

FIG. 2(1) 
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1551 CAACTGCCAA GCAAgTTTCG cTGAgTTtGA CACATGG.AT CCC . TGTGGA 

1601 TCAAcTGCCc TAgGAcTCcG TTTGCACCCA TgtgACacTG ttGAcTTTGC 

1651 CCTGAcgAa. gCAgggcCAa cagtccccta AcTTAATtAC aAaAAcTAAT 

1701 GAcTAAGAgA gAgGTGGcTA gAgCTGAgGC CCCTG.AgTC AgGcTGTGGG 

1751 TGGGATCaTc tCCAgTACAG GAAgtGAGAc TTTCATTT.C ctCCtTTcCA 

1801 Ag-AgAgGGc TGAgGGAgCa gGGTTgAgCa ActGGTGCAg ACAgCCTAgc 



1851 TGGActTTGG GTgAgGc . . g gtTCAg jCCAT gAgGctGGj cT gTgCttTtcT 

— — — — ^— — — ^— — — 

1901 cGGGGGcCCT GcTGGGGcTA cTGGCAGSTA AGGAGGAAgG AgGcTGAGGG 



1951 GAGGGGG..C CCCTGGGAGG GAGCcTG.CC CTGGGTTGct AACCATcTCC 
2001 T. . . .Ct.CT GCCAAAAcj^ CAGGGGACAG GGAATGAC.T GTCCTCACAA 



2051 AAAATCAGcT ACTTtGcTGC CATCCtTCAC GGTGACACCC ACGGtTACAG 
2101 AGAGCACTGG AACAAcCAGC CACAGGACTA CCAAgAGCCA CAAAACCACC 



FIG. 2(11) 
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